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Abstract 

Enumerating all solutions of a relational algebra equation is a nat- 
ural and powerful operation which, when added as a query language 
primitive to the nested relational algebra, yields a query language for 
nested relational databases, equivalent to the well-known powerset al- 
gebra. We study sparse equations, which are equations with at most 
polynomially many solutions. We look at their complexity, and com- 
pare their expressive power with that of similar notions in the powerset 
algebra. 



1 Introduction 

Suppose we are allowed to see only a view on a database B, computed by 
a relational algebra expression e. If we still want to find out what B is, 
we might try to "invert" e (assuming we know this expression), which will 
only work when we also know the finite domain D of B. Specifically, we can 
enumerate all databases X over D, and test for each X whether it satisfies the 
equation e(X) = e(B). One of these solutions will be B of course, so if the 
set of all solutions is not too big, it might provide us with useful information 
to start our detective work. 

The above simple scenario from database security led us to wonder what 
can be said in general about the solution of equations in the relational algebra. 
Generally, if e\ and e<i are two algebra expressions over some database schema 
augmented with some relation variables X\ y . . . , X p , we can consider the 
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equation e± = e2- A solution of this equation, given a database B with finite 
domain D, is a tuple (Xx, . . . ,X p ) of relations over D such that e\ and e<i 
evaluate to the same relation on the augmented database (B, X\, . . . , X p ). 

Asking whether there exists a solution of a relational algebra equation on 
a database is almost exactly the same thing as asking whether an existential 
second-order logic sentence is true on that database. Hence, by Fagin's theo- 
rem |Fag74 EF95 , the problems that can be formulated as finding a solution 
of some relational algebra equation are nothing but the problems in NP. 

However, in the present paper, we start from the observation that the 
set of all solutions of an equation, being a set of tuples of relations, is a 
nested relation. One can therefore consider the enumeration of all solutions 
of an equation as a query language primitive, which can be added to the 
nested relational algebra. We introduce and study this extension of the 
nested relational algebra, which we call the equation algebra. The equation 
algebra is extremely powerful: it is equivalent to the well-known powerset 
algebra for nested relations. Our particular interest, however, is in what can 
be expressed in the equation algebra by using only equations that have a 
solution set of polynomial size on each database. We call such equations 
sparse. 

Our interest in sparse equations does not stem from time efficiency con- 
siderations. Indeed, it is not obvious how knowing that an equation is sparse 
would help in actually finding even one solution more quickly. It is neither 
obvious, however, that it would not help. For example, consider the prob- 
lem of checking on a given database whether some fixed sparse relational 
algebra equation has a solution. Using an extension of Fagin's theorem to 
nested relational databases, we show that this problem can be NP-hard only 
if every problem in NP can already be decided by a polynomial-time non- 
deterministic Turing machine that has only polynomially many accepting 
computations on each input. The latter is one of the many unresolved ques- 
tions in computational complexity theory [A"I186j. 

Nevertheless, sparse equations are still interesting from a space efficiency 
standpoint. Indeed, for the natural evaluation strategy for equation algebra 
expressions to run in polynomial space, it is necessary that all equations 
occurring in the expression are sparse. Interest in fragments of powerful 
query languages for which the natural evaluation strategy is polynomial-space 
is not new to database theory research. For example, Abiteboul and Vianu 
[AV91J showed that the parity query is not expressible in the polynomial- 
space fragment of various computationally complete query languages. 
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Closer to our topic is the work of Suciu and Paredaens |SP97j , who showed 
that queries such as transitive closure and parity are not expressible in the 
polynomial-space fragment of the powerset algebra for nested relations. This 
fragment consists of all powerset algebra expressions where all intermediate 
results are of polynomial size, on each database. Note that this fragment does 
make sense as there are expressions that always produce a result of logarith- 
mic size; applying the, exponential, powerset operator to such expressions 
produces a result of polynomial size. 

We also mention Grumbach and Vianu |GV95j . who also studied a spar- 



sity notion in connection with queries over nested relational databases, al- 
though they considered sparsity as a property of databases rather than of 
query language expressions. 

Suciu and Paredaens conjectured in general that the polynomial-space 
fragment of the powerset algebra has no more power than the nested rela- 
tional algebra without powerset. (This conjecture has been confirmed for 
monadic database schemas [VdBj.) At first sight, the operator that we add 
to the nested relational algebra, to enumerate all solutions of an equation, 
does not seem to be that different from the powerset operator. After all, 
both operators perform some kind of potentially exponential enumeration. 

Yet, as we will point out, the analogue of the Suciu-Paredaens conjecture 
does not hold for the sparse fragment of the equation algebra. Specifically, 
using sparse equations only, we can express transitive closure; in fact, we 
can express any fixpoint query. This complements a result by Abiteboul and 
Hillebrand A H95j . who showed that transitive closure becomes expressible 
in the powerset algebra in polynomial space, provided we use a more clever 
"pipelined" evaluation strategy. Actually, every fixpoint query is already 
expressible using equations that are not just sparse, but even unambiguous: 
they have a unique solution on each database. Unambiguous equations in 
the relational algebra are known as implicit definitions in first-order logic, 
and were studied in the context of finite model theory by Kolaitis |Kol90j . 
Kolaitis already showed that every fixpoint query can be implicitly defined. 
We offer a straightforward, direct proof. 

Another example of the differences between the sparse fragment of the 
equation algebra and that of the powerset algebra is given by the well-known 
nesting operator of the nested relational algebra. This operator becomes 
redundant once we extend the algebra with the solution operator or with 
the powerset operator. However, the original nesting operator never blows 
up exponentially. We show that, without using the nesting operator itself, 
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nesting is expressible in the equation algebra using sparse equations only, 
but that the same is not possible with a polynomial-space powerset algebra 
expression. 

However, there are also similarities between the two fragments. Specifi- 
cally, we prove an analogue to the Suciu-Paredaens result, to the effect that 
the parity query is not expressible in the sparse fragment of the equation 
algebra either. This is our main technical contribution; the proof is in the 
style of an elegant argument of Liebeck |Lie83j . invoking Bochert's theorem 
on the order of primitive permutation groups. Coming back to connection 
with implicit definitions in first-order logic, our result generalizes the known 
and easy fact that the parity query cannot be defined implicitly Kol90j, in 
two directions: from unambiguous to sparse; and from a single equation to an 
arbitrarily complex expression involving several, possibly nested, equations. 

This paper is organized as follows. Section |21 recalls the nested relational 
data model. Section |3] introduces relational algebra equations. Section |U in- 
troduces the equation algebra. Section El introduces sparse equations, as well 
as the natural evaluation strategy for equation algebra expressions. Sectional 
studies the time complexity of sparse equations. Finally, Section [7| presents 
the comparison with the polynomial-space powerset algebra. 

2 Preliminaries 

We quickly recall the nested relational data model and algebra |TF86| [AHV95 . 

Relation types are defined as follows. The symbol is a type; and, if T\, 
. . . , Tk are types, then so is (rj, . . . , 7*). For a type r, and some set D of 
atomic values, the relations of type r on D are inductively defined as follows. 
A relation of type on D is just an element of D (this serves merely as the 
base case for the induction). A relation of type (n, . . . , Tk) on D is a set of 
/c-tuples (x\, . . . , Xk) such that Xi is a relation of type on D, for i — 1, . . . , k. 

A database schema is a finite set S of relation names, where each relation 
name has an associated type different from 0. A database B over S consists 
of a non-empty finite domain D of atomic values, together with, for each 
relation name R in S, a relation R B of type r on D, where r is the type of 
R. 

The operators of the nested relational algebra are those of the standard 
relational algebra (union U and difference — of relations of the same type; 
cartesian product x; projection 7r; selection a for equality, which can now be 
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set equality of nested relations), plus the operators nesting v and unnesting 
//, defined as follows. 

Let R be a relation of type (t±, . . . , T k ), and let ii, . . . , i p G {1, . . . , k}. 
Then the nesting Vi lt ...,i p (R) equals the relation 

[(xi,...,x k ,{(y h ,...,y ip ) | (yi,...,y k ) e R 

and Xj = yj for each j e {1, . . . , k} — . . . , i p }}^j 

(x ± , . . . ,x k ) G i?| 

of type (n, . . .,T k , (r h ,. . .,T ip )). 

Let R be as in the previous paragraph, and let % e {1, . . . , A;} such that 
r, 7^ 0; so Tj is of the form (a>i, . . . , loi). Then the unnesting fii(R) equals the 
relation 

{(xi, . . . , a; fc , yi, . . . , y e ) \ (x 1: . . . , x k ) e R and (y 1: . . . , y t ) e x { } 

of type (ri, . . . ,r fc ,wi, . . . 

The expressions of the nested relational algebra over a schema S are now 
built up using the above operators from the relation names in S and the 
symbol D, which stands for the finite domain of the input database. The 
relation to which an expression e evaluates on a database B is denoted by 
e(B). 

One can extend the nested relational algebra to the powerset algebra by 
adding the powerset operator, defined as follows. Let R be a relation of type 
(ti, . . . , T k ). Then the powerset equals the relation {S \ S C R} of type 
((Ti,...,T fc )). 



3 Equations 

Let S and <Y be disjoint database schemas; S is the actual database schema, 
while X is thought of as a set of additional relation variables. Let e\ and e2 
be two expressions over the expanded schema S U X. 

Definition 3.1. Given a database B over S, a solution to the equation e\ = 
e2 is a database A over X with the same finite domain as B, such that 
e\{B,A) — e 2 (B,A). 
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Here, (B,A) denotes the expansion of B with A, i.e., the database over 
S U X that has the same finite domain as B, that equals B on S, and that 
equals A on X. 

Example 3.2. For a very simple example, let R G S and let X = {X}, where 
X has the same type as R. Then XUR = R is an equation. Given a database 
B over 5, a database A over {X} is a solution if and only if X A C 

For another example, let X be a relation variable of type (0,0). One 
can write a relational algebra expression e such that on any database A over 
{X} with finite domain D, e(A) is empty if and only if X A is one-to-one, the 
projections ni(X A ) and 7r 2 (X A ) are disjoint, and their union equals D. An 
example of an e that works is 

7TlO- 2 ^4(Xi = 3 (X X X) U 7^0-2=4(71^3 (X X X) 

U ( 7 r 1 (X)-(7r 1 (X)- 7 r 2 (X))) 

U(D- (ttxPQ U tt 2 (X))) U ((7n(X) U tt 2 (X)) - £>) 

Then the equation e = has a solution on a database 5 with finite domain 
D if and only if the cardinality of D is even. (Technically, e = is not an 
equation because the symbol is not an expression, but we can easily take 
here to stand for the expression D — D which always evaluates to the empty 
relation.) □ 

Remark 3.3. In the above example, we used an equation of the special form 
e = 0. Actually, this form is not so special at all, because any equation 
ei = e 2 can be brought in this form as e\ A e 2 = 0, where e\ A e 2 stands for 
(ei — e 2 ) U (e 2 — ei) (symmetric difference). 

Alternatively, one might wonder about the use of dis equations, of the 
form e ^ 0. These are nothing but equations in disguise, because they can 
also be written as iri(D x e) = D. Conversely, any equation e 1 = e 2 can also 
be written as the disequation D — ir^D x (ei Ae 2 ))^0. □ 

4 The equation algebra 

We are now ready to extend the nested relational algebra with a solution 
operator for equations. We refer to the resulting algebra as the equation 
algebra. 

To allow for an elegant definition, we do not fix a schema S in advance. 
Rather, we assume a sufficiently large supply of relation names of all possible 



6 



types. Any relation name can now occur in an expression. Like in logic 
formulas, some will occur free and others will occur bound. Bound relation 
names are bound by the solution of an equation, and serve as the variables 
of the equation. Within the equation, however, they are still free. We denote 
the set of relation names that occur free in an equation algebra expression e 
by free(e). 

For the constructs of the nested relational algebra, this is all straightfor- 
ward: for a relation name R, we have free(R) := {R}; for expressions e of 
the form (eiUe2), (ei — e 2 ), or (eixe2), we have /ree(e) := /ree(ei) U/ree(e 2 ); 
for expressions e of the form cr(e'), vr(e'), v(e'\ or /i(e'), we have freeze) : = 
freeze'). For the expression D, we have free(D) := 0. 

The definition of the new solution operator is now the following: 

Definition 4.1. Let e\ and €2 be expressions, and let Xx, ■ ■ ■ ,X p be a se- 
quence of distinct relation names. Then 

{(Xx, . . .,X P ) I ex = e 2 } 

is also an expression (called a solution expression). We define its free set as 
(/ree(ei) U/ree(e 2 )) — {Xx, • • • ,X P }. We say that the Xi become bound. 

Note that this is a recursive definition, in the sense that e\ and e 2 can 
contain solution operators in turn. To avoid clutter, we disallow equation 
algebra expressions in which a free relation name at the same time becomes 
bound in some subexpression, as in X x {(X) | X U R = R}. 

An expression e in the equation algebra can be evaluated on databases B 
over any schema that contains free(e). We already know how this evaluation 
is defined for the constructs of the nested relational algebra. So we only have 
to give 

Definition 4.2. For a solution expression, e, of the form {(Xx, . . . , X p ) | 
ei = e 2 }, and a database B, the evaluation e(B) equals the relation 

{(X±, . . . , X^ ) I A is a database over {Xx, . . . , X p } 

that is a solution of ex = e 2 , given B}. 

This relation is of type (tx, ■ ■ ■ , t p ) , where Tj is the type of Xi for i = 1, . . . , p. 

Example 4.3. Recall the simple example equation X U R = R from Exam- 
ple 13.21 We can turn this equation in the following equation algebra expres- 
sion e: {(X) I X U R = R}, or, more readibly, {(X) \ X C R}. Note that 
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freeze) = {R}. On any database B over {R}, the relation e(B) equals U(R B ) 
(recall the powerset operator II from Section |2J. In other words, the equation 
algebra expression e is equivalent to the powerset algebra expression U(R). 

The equation algebra allows equations to be used inside equations. For 
example, if we want to compute the powerset of the powerset of R, we can 
write: 

{(Y)\YC{(X)\XCR}}. 

As a third example, let R and T be relation names of the same binary type 
(r, r) for some r. One can write a relational algebra expression e tc such that 
on any database C over {R, T}, e tc is empty if and only if R c C T° and T° is 
transitively closed. One can also write a nested relational algebra expression 
e m i n that selects, out of a set of binary relations, the minimal ones w.r.t. set 
inclusion. Explicit forms for e tc and e m i n have been given by Gyssens and 
Van Gucht |GVG| . Then the following equation algebra expression computes 
the transitive closure of relation R: 

7T2,3/iie m in({(T) | e tc = 0})- 

Indeed, the subexpression {(T) | e tc = 0} returns the collection of all transi- 
tively closed relations on the same domain as R and containing R; applying 
e m i n to that collection results in the singleton consisting of the minimal el- 
ement, i.e., the transitive closure of R (by definition of transitive closure); 
applying unnesting /ii produces the actual tuples in the transitive closure, 
keeping the nested relation (cf. our definition of the effect of \i in Section EJ); 
and applying 7r 2j 3 finally removes the nested relation. □ 

In the above example we saw that the powerset operator is expressible in 
the equation algebra. Conversely, the solution operator is easily expressed in 
the powerset algebra. Hence, 

Proposition 4.4. The equation algebra is equivalent to the powerset algebra. 

Proof. To see that {(X%, . . . , X p ) \ e± = e 2 } can be expressed in the powerset 
algebra, we begin by noting that for any relation type r one can write a 
powerset algebra expression IT yielding the collection of all relations of type 
r on D. For example, IT^ ) is U(D x D), and iT ^ is U(D x 11(D)). 
Hence, if the type of Xi is Tj for i = 1, . . . , p, then n T1 x ■ • • x U Tp yields 
the collection of all potential solutions. Now it suffices to observe that one 
can write nested relational algebra expressions that apply t\ or e 2 to each 
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database in this collection separately. Explicit forms of such expressions 
have been given by Gyssens and Van Gucht |GV Gj. After that, the actual 
solutions can be selected by an equality selection. □ 



5 Sparse equations 

So far, the equation algebra is merely another syntax for the powerset algebra, 
or, if you want, higher-order logic. However, when we consider a natural 
evaluation strategy for equation algebra expressions, we start to notice some 
differences. 

By the natural strategy to evaluate a solution expression of the form 
{{Xi, . . . , X p ) | e\ = €2}, we mean the following. Enumerate all databases A 
over {Xi, . . . , X p }, on the finite domain of the given input database, one by 
one, reusing the same space. For each A we test whether it is a solution (by 
recursively evaluating e\ and e-i), and if so, we include it in the result. 

For the constructs of the nested relational algebra, the natural evaluation 
strategy is clear: if we have to evaluate an expression of the form c\Qe2, with 
G {U, — , x}, we create two intermediate results by recursively evaluating 
e\ and e<i, and then apply to these two intermediate results. Similarly, 
if we have to evaluate an expression of the form 9(e), with 9 6 {n, a, v, fi} 
(and parameters added in subscript), we create an intermediate result by 
recursively evaluating e, and then apply 9 to this intermediate result. 

In view of this natural evaluation strategy, we now propose 

Definition 5.1. An equation is called sparse if all its relation variables are 
of flat type, i.e., of type of the form (0, . . . , 0), and the number of solutions 
on any given database is at most polynomial in the size of that database. 

Example 5.2. The two equations from Example IH.2l are not sparse. Probably 
the simplest example of a non-trivial sparse equation is the following. Let X 
be a relation name of type (0). One can write a relational algebra expression 
e over {X} such that on any database A over {X}, e(A) is empty if and 
only if X A is a singleton. Then the equation e = 0, where X is taken as 
the relation variable to be solved for, is sparse. Indeed, given any database 
B with finite domain D, the solutions are precisely all singleton subsets of 
D. There clearly are only a linear (and thus at most polynomial) number of 
possible solutions. □ 
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Remark 5.3. A natural alternative definition of sparsity would be the one 
where, in Definition 15.11 we would look only at databases over the schema 
consisting of the relation names that actually occur free in the equation. One 
easily sees, however, that this alternative definition yields the same notion 
of sparsity. □ 

Sparse equations are connected to the natural evaluation strategy in the 
following way: 

Proposition 5.4. The natural strategy to evaluate an equation algebra ex- 
pression e runs in polynomial space, if and only if all equations occurring in 
e are sparse. 

Here, we count not only the space occupied by the intermediate results 
stored during evaluation, but also the size of the final result. 

Proof. The if-direction is clear. For the only-if direction, we work by induc- 
tion on the nesting depth of equations. The base case — expressions that do 
not contain any equations at all — is trivial. 

For the inductive step, consider a top-level equation {(X\, . . . , X p ) \ e.\ = 
ei} occurring in e. The natural strategy to evaluate this equation runs in 
polynomial space, so in particular, for each expansion of each database B over 
free(e) with a candidate solution A over {Xi, . . . , X p }, the natural evaluation 
of ei and e^ on (B, A) runs in polynomial space. In this way we consider 
every possible database C over free(e)U{X 1 , . . . , X p }, because the restriction 
of C to freeze) is a possible B, and C itself then is a possible expansion of 
B. Hence, the natural evaluation strategies of e\ and C2 in general run in 
polynomial space. 

Formally, we must note here that e± and C2 might not actually mention 
certain relation names in free(e) or . . . ,X p }, and that there is still the 
formal possibility that their natural evaluation might not run in polynomial 
space on databases over schemas not containing these names. However, using 
Remark 15.31 it can be seen that this is impossible. 

By induction, we can therefore conclude that all equations occurring 
nested inside a top-level equation are sparse. 

The top-level equation itself must also be sparse. In proof, if one of the 
Xi would be of non-flat type, even one candidate solution can already be of 
exponential size. Indeed, even in the simplest case where Xi would be of type 
((0)), on a domain with n elements, a possible candidate value for Xi is the 
collection of all subsets of that domain, which is of size 2 n . So, every X t is 
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of flat type. Furthermore, since we store the solution set as an intermediate 
result, it must be of at most polynomial size on all databases over /ree(e). 
Since the individual solutions are flat databases and thus of polynomial size, 
the cardinality of the solution set must therefore be at most polynomial. □ 

6 Time complexity of equation nonemptiness 

The time complexity of solving sparse equations is closely linked to an open 
question from computational complexity theory. Unlike the previous section, 
in this section we are not talking about the natural evaluation strategy, whose 
time complexity is clearly at least exponential as soon as there are equations 
to be solved. 

Instead, we will be looking at the time complexity of the nonemptiness 
problem of equations. The nonemptiness problem of an equation over a 
schema S with relation variables X is the problem of deciding, given a 
database over S, whether the equation has a solution on that database. 
In the present section we will only consider equations that do not contain 
equations inside. 

Let us begin by considering equations that are not necessarily sparse, 
but that still have only flat variables. The nonemptiness problem of such 
a flat- variable equation is clearly in NP. Now suppose, moreover, that the 
database schema S is also flat; then S U X (the expansion of S with the 
relation variables of the equation) is an entirely flat schema. Of course, 
the equation e\ = C2 is still in general in the nested relational algebra, i.e., 
ei and €2 can contain v and \x operators. A result by Paredaens and Van 
Gucht |PVG92j , however, implies that the nested relational algebra condition 
e\ = €2 can also be expressed in the form e ^ 0, with e a flat relational 
algebra expression. The nonemptiness problem of the equation thus amounts 
to asking whether {(Xx, ■ ■ ■ ,X p ) \ e 7^ 0} is nonempty on a given database 
B over S. Equivalently, we ask whether the existential second-order logic 
(3SO) sentence 3Xl . . . 3X p <p e is true on B, where <p e is a first-order logic 
sentence expressing that e ^ 0. Moreover, by the equivalence of relational 
algebra and first-order logic, any 3SO property can be obtained in this way. 
Now, Fagin's theorem |Fag74[ IEF95j states that 3SO captures exactly the 
NP properties of flat relational databases. Hence, the class of nonemptiness 
problems of flat-variable equations over flat database schemas is exactly the 
class of NP properties of flat relational databases. 
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What if S is not necessarily flat? We next show that we still get exactly 
NP. In essence, this is an extension of Fagin's theorem to nested relational 
databases. 

Proposition 6.1. Every property of nested relational databases over some 
fixed schema S, that is in NP and closed under isomorphism, corresponds to 
the nonemptiness problem of some flat-variable equation over S. 

Proof. The crux is a representation of nested relational databases by "pseudo- 
flat" ones, also used by Gyssens, Suciu, and Van Gucht GSVG01 . Given 



a nested relational database B, we define its extended domain, denoted by 
edom(.B), as the union of its finite domain of atomic values with the set of 
all relations occurring (possibly deeply nested) in B. We regard the relations 
in the extended domain as if they were atomic values. Now for any nested 
relational database schema S we can construct a flat one S, together with 
a mapping rep from the set of databases over S onto the set of databases 
over S, expressible in the nested relational algebra. The details of this map- 
ping need not concern us here. Important is that we can furthermore con- 
struct a converse mapping flat from the set of databases over S to the set 
of databases over S, also expressible in the nested relational algebra, with 
the following properties for each database B over S: (1) the finite domain of 
flat(B) equals edom(£>); and (2) rep(flat(B)) = B. Note that while flat is 
expressed in the nested relational algebra, the result flat(B) is not really a 
flat database, because of the nested relations in the extended domain. How- 
ever, it is "pseudo-flat," in the sense that we regard these relations as if they 
were atomic values. For any relation name R of S, we denote the nested 
relational algebra expression defining the i?-component of the mapping flat 
by flat R . Likewise, we denote the expression defining the D-component by 
flat D . 

Given this representation, the proof is straightforward. Let L be an NP 
property of databases over S, closed under isomorphism. Define the property 
L of databases over S as follows: F satisfies L if rep(F) satisfies L. Then L 
is in NP, and is also closed under isomorphism. Hence, Fagin's theorem gives 
us an 3SO sentence 3Xi . . . 3X p (p over S expressing L. By the equivalence 
of relational algebra and first-order logic, there is a flat relational algebra 
expression e over S U {Xi, . . . ,X p } such that the first-order logic sentence 
ip is equivalent to e ^ 0. Now modify e as follows: for every relation name 
R of <S, replace every occurrence of R in e by flat R . Likewise, replace every 
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occurrence of D in e by flat D . Denote the resulting nested relational algebra 
expression by e'. 

We now have, for any database B over S, that B satisfies L if and only if 
3X% . . .X p e' 7^ is true on B. The condition e' 7^ can easily be written 
as an equation (cf. Remark [3. 3|) . □ 

We are now ready to turn to sparse equations. Their nonemptiness prob- 
lem is not just in NP, but actually in the complexity class FewP |A1186j . 
consisting of all problems that can be decided by a polynomial-time non- 
deterministic Turing machine that has at most polynomially many accepting 
computations on each input. Clearly, P C FewP C NP, but the strictness of 
these inclusions remains open. 

The obvious question to ask is whether Proposition 16. ll remains true if we 
focus on sparse equations, and replace 'NP' by 'FewP.' The answer is an easy 
"yes," but then we must restrict attention to ordered databases: databases 
that include a total order on their finite domain as one of their relations. 

Proposition 6.2. Every property of ordered nested relational databases over 
some fixed schema S, that is in FewP and closed under isomorphism, corre- 
sponds to the nonemptiness problem of some sparse equation over S, when 
restricted to ordered databases only. 

Proof. The usual proof of Fagin's theorem immediately yields the case where 
S is flat. Indeed, in that proof, to express an NP property decided by some 
polynomial-time bounded non-deterministic Turing machine M, one writes 
an 3SO sentence 3X\3X 2 . . . 3X P tp where X% stands for an order on the 
domain; X 2 , . . . , X p encode (using the order in Xx) a computation of M; 
and if checks whether the computation is accepting. As we are dealing with 
a FewP property, M has only polynomially many accepting computations. 
Hence, the equation {(Xi, . . . , X p ) \ ip} would be sparse were it not for X±, as 
there are exponentially many possible orders on a finite domain. On ordered 
databases, however, there is no need for X\ and we obtain a genuinely sparse 
equation. 

This is for flat databases; for general nested relational databases we use 
the same representation technique as in the proof of Proposition 16. II □ 

As a corollary we get 

Corollary 6.3. There exists a sparse equation whose nonemptiness problem 
is NP-complete, if and only if FewP = NP. 
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7 Sparse equations versus sparse powerset ex- 
pressions 

Naturally, we call an equation algebra expression sparse if all equations oc- 
curring in it are sparse. Inspired by Proposition 15.41 we can also define a 
sparsity condition on powerset algebra expressions: call a powerset algebra 
expression sparse if its natural evaluation strategy (defined in the obvious 
way) runs in polynomial space. 

Remark 7.1. Using standard techniques one can show that sparsity is unde- 
cidable, for equation algebra expressions as well as powerset algebra expres- 
sions. An interesting question, raised by an anonymous referee, is whether 
one can give useful syntactic restrictions that guarantee sparsity. Ideally 
every sparse expression would be equivalent to one satisfying the syntactic 
restrictions. □ 

Suciu and Paredaens [SP97J showed that transitive closure of a flat binary 
relation is not expressible by a sparse powerset expression. In Example 14. H| 
we gave an obvious equation algebra expression for transitive closure, but 
that expression was not sparse. We can do better: 

Proposition 7.2. Transitive closure of a flat relation is expressible by a 
sparse equation algebra expression. 

Proof. Given a binary relation R and a natural number n ^ 1, we define the 
relation R n as R o ■ ■ ■ o R (n times R), where o is the classical composition 
operator of binary relations: S oT = 7ri i4 cr 2= 3(S' x T). Further, define R^ n as 
UiLi R % -> an d define R =n as R n — R^ n ~ l . Note that R^ R \ equals the transitive 
closure of R, and that for n > \R\, R^ n = R^' R K 
Now consider the following 6-ary relation Run: 



We show next that there is an equation whose only solution, given R, 
is Run. This proves the Proposition, because all we then have to do is 
unnest the solution set and project on the middle two columns to get the 
transitive closure. (The only exception is when Run is empty, in which case 
the transitive closure of R is R itself, but this can also easily be tested in the 
nested relational algebra.) 



\R\ 




i=l 



14 



The desired equation expresses the conjunction of the following conditions 
on relation variable X: 

1. For any pair (x 5 , x 6 ) G 7r 5 6 (X), we denote the relation 

{(xi, x 2 , x 3 , x 4 ) I (xi, . . ■ , x 6 ) G X} 

by X(x 5 ,x 6 ), and denote further 

X(x 5 ,x 6 ) := 7r lj2 (X(x 5 ,x 6 )) and 
X(x 5 ,x 6 ) := 7r 3i4 (X(a;5,x 6 )). 

Then for every (x, y) G 7r 5i6 (X), we must have 

(a) X(x,y) = X(x,y) x X(x,y); 

(b) X(x,y)DR; 

(c) X(x, y) = X(x, y) U X(x, y) o R; and 

(d) (x,y) eX(x,y)-X(x,y). 

(e) Furthermore, every pair (a/, ?/) in the latter difference belongs 
to 7r 56 (X), with X(x',y') = X(x,y) (and thus also X(x',y') = 
X(x,y)). 

2. R =2 C 7r 5 6 (X), and for every (x,y) G R =2 , we have = R. 

3. For every (x, y) G 7r 5i6 (X) such that X(x,y) o R — X(x,y) 7^ 0, there 
exists a pair (x',y r ) G 7r 5 6 (X) with X(x',y') = X(x,y). 

4. For every (x, y) G 7r 5)6 (X) such that X(x, y) 7^ -R, there exists a pair 
(x'y) G vr 5 , 6 (X) with X(x',y') = X(x,y). 

The conjunction of the above conditions expresses that X equals Run. 
Indeed, by (J2J), (HcJ) and JTaJ) we know that R^ 1 x R^ 2 x i?= 2 C X. By in- 
duction andbv pi. (fTcjl . (flejl and (pp) we know that i?^ x x i? =m C 
X and hence finn C X. Moreover, for every (x,y) G R =l+1 we have 
{(ari, yi,x 2 , y 2 ) I (xi,y 1 ,x 2 ,y 2 ,x,y) e X} = R^ 1 x On the other 
hand, if (x,y) G 7r 5 6 (X) then X(x,y) = R, in which case (x, y) G R =2 by 
(fTcj) and (JTHJ), or X(x,?/) 7^ i?, in which case we know by induction and by 

and (jlb|) that (x, y) G R =l+l for some z. This proves X = Run. □ 
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Remark 7.3. The equation constructed in the above proof is not only sparse, 
it is unambiguous: it has a unique solution on each input database. Moreover, 
the same proof works more generally for any fixpoint query |AHV95j on flat 
databases. The only difficulty is that fixpoint queries start from the empty 
relation, while in our proof of Proposition 17.21 we start from R, but that is 
easily dealt with. As already explained in the Introduction, we thus basically 
rediscovered an earlier result by Kolaitis to the effect that every fixpoint query 
is implicitly definable in first-order logic [Kol90 . But note the directness of 
our proof, straightforwardly specifying the run of the fixpoint computation in 
an unambiguous way. The original proof (also presented by Ebbinghaus and 
Flum jEF95j ) is a bit more roundabout, specifying the "stage comparison" 
relation instead. □ 
Another, perhaps a bit frivolous, example of a query that is expressible us- 
ing sparse equations but not using sparse powerset expressions is the nesting 
operator v. It is easy to express v in the powerset algebra using the powerset 
operator and the other operators, but not v itself; so v is not primitive in 
the powerset algebra. As a consequence (Proposition I4.4|) . v is not primitive 
in the equation algebra either. We next observe that when we restrict to 
sparse expressions, nesting remains imprimitive in the equation algebra, but 
becomes primitive again in the powerset algebra. 

Proposition 7.4. Nesting is not expressible by a sparse powerset expression 
without using the v operator itself. 

Proof. Suppose we want to express nesting of a flat binary relation R. The 
first application of the powerset operator is to the result of a flat relational 
algebra expression e applied to R. Let us focus on the case where R is 
the identity relation on a finite domain of n elements. A straightforward 
argument by structural induction shows that, on identity relations, every 
relational algebra expression is equivalent to a finite disjunction of equality 
types. Here, an equality type is a maximally consistent conjunction of equal- 
ities %i = Xj and non-equalities Xi ^ Xj over the variables x\, . . . ,Xk, where 
k is the output arity of e. We thus see that either e(R) is empty on all such 
R (this is when the disjunction is empty), or e(R) is of size at least n when 
n is at least k. In the empty case, the powerset operator is useless, and 
we continue to the next application of powerset. Otherwise, the powerset 
operator explodes and the overall expression is not sparse. □ 

Proposition 7.5. Nesting is expressible by a sparse equation expression with- 
out using the v operator itself. 
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Proof. Let R be a relation name of type (0, 0), and let X and F be relation 
variables of type (0). We can write a relational algebra expression e such 
that on any database C over {R, X, F}, e(C) is empty if and only if X is a 
singleton {x} with x 6 tti(-R), and F = {y | (x,y) G i?}. An example of an 
e that works is: (A stands for symmetric difference) 

TTitTi^Pr X X) U (X - TT^i?)) U (F A 7T 3 (7 1=2 (X X i?)) 

Hence, the expression 

l x 1 ({(X,Y)\e = 0}) 

is a sparse equation expression equivalent to z/ 2 (_R). 

The construction for general nesting operations is analogous. □ 

Our final, and main technical, contribution concerns the parity query. 
Suciu and Paredaens showed that the parity of the cardinality of a finite set 
is not expressible by a sparse powerset expression. We show the analogue for 
the equation algebra: 

Proposition 7.6. The parity query is not expressible by a sparse equation 
expression. 

Proof. Suppose, to the contrary, that we have a sparse equation expression 
to express the parity of the cardinality of a finite domain D. We may assume 
that the input schema is empty, i.e., an input database consists of D and 
nothing else. Consider an innermost equation Eq occurring in our expression. 
It may be nested inside other equations E\, . . . , E}., enumerated from the 
inside to the outside. By Remark 13.31 for each j 6 {0, . . . , k}, Ej can be 
written in the form {(X(, . . . , X? ) \ ej ^ 0}, for some ij, where ej is a 

flat relational algebra expression over the flat schema {X{, . . . , Xf.} possibly 
expanded with certain free variables X l m , where / > j and m < ij. 

By our assumption there are at most polynomially many solutions to 
equation E^. For each solution of Ek there are at most polynomially 
many solutions A k _i of e k -i and so on. A sequence A = (A k , . . . , A ) of 
databases is a solution vector for E , if A k is a solution for E k , and each Aj, 
j < k is a solution for Ej, given A k , . . . , Aj + \. 

Now let A = (A k , . . . , Aq) be a solution vector for Eq, given an input D 
of size n. Then for every permutation / of D, f(A) = (f(A k ), . . . , f(A )) is 
also a solution vector for Eq. The number of different such f(A) is precisely 
n!/|Aut(A)|, where Aut(A) is the group of automorphisms of A. Since all 
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equations are supposed to be sparse, this number is at most n for some 
fixed £, or, equivalently, |Aut(A)| ^ n\/n l . Putting k = £ + 1, this implies 
|Aut(A)| ^ {n — k)\ for sufficiently large n. 

We thus need to know more about large permutation groups. The fol- 
lowing crucial lemma will give us the information we need. The group of 
permutations of a finite set D is denoted by Sym(D), and its alternating 
subgroup of even permutations by A\t(D). If G is a subgroup of Sym(D), a 
fixed set for G is a subset A C D, such that every g G G maps A to A. The 
action of G on a fixed set A (as a subgroup of Sym(A)) is denoted by G A . 

Lemma 7.7. Let k be a fixed natural number. Let G be a subgroup of 
Sym(D), \D\ = n, n sufficiently large. Then \G\ ^ (n — k)\ implies the 
existence of a fixed set A with |A| ^ n — k, such that G A contains Alt(A). 

Proof of Lemma \l. 7| For background on finite permutation groups, we refer 
to Wielandt's book |Wie64j . but here are a few preliminaries. An orbit of 
a permutation group G on a set D is a set of the form {g(x) \ g G G}, for 
some x G D. We call G transitive if D is one single orbit of G. Further, G is 
called primitive if it is transitive and has no nontrivial blocks. Here, a block 
of G is a subset A C D such that for all g G G, the set g(A) is either equal 
to A, or disjoint from it. Trivial blocks are 0, D, and the singletons. If G is 
not primitive but transitive, there is always a complete block system which 
partions D in equal-sized non-trivial blocks. We recall: 

Bochert's Theorem (1889). Let G be primitive on D, not containing 
Alt(L>). Let \D\ = n. Then \G\ ^ n\/\n/2]\. 

For the proof of the Lemma, first assume that G is transitive. There are 
two possibilities: 

1. G is imprimitive with, say, b blocks of size a (a > 1, b > 1, ab — n). 
Then |G] ^ 6! (a!) 6 , which in turn is at most 2(|_n/2j !) 2 for n sufficiently 
large. Thus, by what is given about \G\, 



However, this is impossible for n sufficiently large. 

2. G is primitive. Then, unless G contains Alt(D) (in which case the 
Lemma is proved), by Bochert's theorem, 



(n-k)\ ^ 2([n/2j!) 2 . 



(1) 



(n - fc)! ^ 



\n/2]V 



(2) 
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Again, this is impossible for n sufficiently large. 

Now assume G is intransitive. Let A be an orbit of G of maximal size; 
let £ ^ 1 be such that the size of A equals n — £. We have \G\ ^ (n — £)\ l\. 
Suppose £ > k. Then (n — £)\£\ reaches its maximum at £ — k + 1. Hence, 
(n - k)\ < \G\ ^ (n - k - 1)1 (k + 1)! and thus n - k ^ (k + 1)! which is 
impossible for large enough n. 

So, £ ^ k, or in other words, the size of A is at least n — k. We have 
\G\ < £! |G A | ^ fc! |G A |, so |G A | ^ (n-fc)!/fc!. By definition G A is transitive 
on A. We can now apply the same arguments as in the case U G is transitive" 
above, for G A instead of G, and get that G A must contain Alt (A). Indeed, in 
the right-hand sides of inequalities Q and |2l n now becomes n — £, which has 
for effect that the upper bounds become smaller. Hence, as these inequalities 
already were impossible, they now become even more impossible. The extra 
factor of l/k\ in the left-hand sides does not have a significant influence. □ 

Invoking this Lemma for G = Aut(A), we get a fixed set A of size at 
least n — k such that any even permutation of A can be extended to an 
automorphism of A. 

Now let X be one of the relation variables of an equation Ej, of arity, 
say, r, and consider any r-tuple t whose components are either the symbol * 
or are in D — A. Let r' be the number of components that are the symbol *. 
Further, let £ be an equality type of r'-tuples. Denote by Neighbors x(t,0 
the set of r'-tuples over A of equality type £ such that, if we replace the 
^-components of t by the components of the r'-tuple (from left to right), we 
get a tuple in X A . 

Claim 7.8. Neighbors x (t, £) is either empty, or consists of all r'-tuples over 
A of equality type £. 

Proof of Claim [7T5| Suppose to the contrary that N := Neighbors x (t,!;) is 
neither empty nor full. Take h± in N, and take hi (of arity r' and of type £) 
not in N. Take two arbitrary elements from A that neither appear in h% nor 
in h 2 , and remove them from A, resulting in A'. Take a third tuple h^ (of 
the right arity and type) over A', and disjoint from hi and h%. If h 3 is in N, 
initialize the set / to {/i 2 ,/i3}; otherwise, put / := {hi,h 3 }. Now complete 
I to a maximal set of pairwise disjoint r'-tuples over A' of equality type £. 
There are at least (n — k — 2)/r' tuples in I. 

Assume at least half of I is outside N; denote the set of these by /'. (The 
case where at least half of I is in N is symmetric.) Fix an h G / PI N. For 
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each tuple s in I', we consider the permutation w s that transposes s and 
h and leaves everything else fixed. If w s happens to be odd, we make it 
even by adding the transposition of the two dummy elements we took out 
of A (when we defined A'). Then each set w s {N) contains s, but does not 
contain any other tuples from I'. Thus, we produce in this way at least 
/(n) := ((n — k — 2)/2r' different sets of r'-tuples over A. Since they are 
even, each w s can be extended to an automorphism. Hence, each of the 
f(n) sets must be the Neighbors^ {f ,£,) of some t'. However, there are less 
than (k + l) r different possibilities for t f , while f(n) is larger than that for n 
sufficiently large. So we get to the desired contradiction. □ 

We call (t,£) an r-ary pattern. If Neighbors ^(t, £) is nonempty (and thus 
full), we say that the pattern is instantiated in X A . Note also that the 
extreme cases, where t consists exclusively of stars or where t has no star at 
all, are also allowed and make sense. 

By the above, we thus see that any solution vector A can be generated 
by the following non-deterministic procedure: 

1. Initialize all relations of A to empty. 

2. Choose at most k different elements from D, playing the role of the 
elements outside A. 

3. For every relation variable X (of arity r, say), run through all r-ary 
patterns, and for each of them, non-deterministically instantiate it in 
X A , or not. 

Since k and the number of relation variables are fixed, the number of possi- 
ble patterns is also fixed. Hence, we can write an expression in the nested 
relational algebra that, given D, constructs the set of all possible vectors A 
of the above non-deterministic procedure. This set is a superset of the actual 
set of solution vectors for E . Equation E can now be replaced by a nested 
relational algebra expression which (1) constructs the set of solution candi- 
dates A, (2) projects out the relations for . . . ,Xf o and (3) selects those 
relations which fulfil Eq. The latter is an easy task for the nested relational 
algebra (OW]. 

Hence, we can get rid of E . Repeating this process, we can get rid of all 
equations, so that in the end we are left with a standard nested relational al- 
gebra expression for the parity query. But this is well known to be impossible 
IAHV951IFW92] . □ 
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Remark 7.9. Note that we have actually shown that over the empty schema, 
where databases consist of a finite domain and nothing else, the sparse equa- 
tion algebra is no more powerful than the standard relational algebra. As a 
matter of fact, the proof can easily be generalized to apply also to schemas 
having only relation names of type (0). □ 
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